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SOME NEW CANONICAL FORMS FOR POLYNOMIALS 

"(N ■ BRUCE REZNICK 

o 

CN ■ Abstract. We give some new canonical representations for forms over C For 

Vh ', example, a general binary quartic form can be written as the square of a quadratic 

form plus the fourth power of a linear form. A general cubic form in (xi, . . . , a;„) 
can be written uniquely as a sum of the cubes of linear forms iij{xi, ■ ■ ■ ,Xj), 1 < i < 
, 1^ ■ j ^ n. A general ternary quartic form is the sum of the square of a quadratic form 

p^ I and three fourth powers of linear forms. The methods are classical and elementary. 

o 

"<C ! 1. Introduction 

Let ifd(C") denote the N{n,d) = (""^^"^j -dimensional vector space of complex 
forms of degree d in n variables, or n-ary d-ic forms. One of the major accomplish- 
ments of 19th century algebra was the discovery of canonical forms for certain classes 
of n-ary (i-ics, especially as the sum of d-th power of linear forms. By a canoni- 
cal form we mean a polynomial expression F{t;x), t E C^i^A so that, for general 
CN ■ P G ifrf(C"), there exists t for which p{x) = F{t;x). In this paper, we present some 

^ . new canonical forms, whose main novelty is that they involve intermediate powers of 

i/^ [ forms of higher degree, or forms with a restricted set of monomials. (These variations 

are suggested by the study of Hilbert's 17th problem, as well as by a theorem of B. 
O ' Reichstein.) They are less susceptible to apolarity arguments than the traditional 

^ . canonical forms, and lead naturally to (mostly open) enumeration questions. 

To take a simple, yet familiar example, 

(1.1) F{h,t2, h; X, y) = {hx + hyf + {hyf 



c^ 



> 



'i 



is a canonical form for binary quadratic forms. By the usual completion of squares, 
p{x, y) = ax^ + 2bxy + cy^ = a{x + -yY + (c — — )y^, as long as a 7^ 0. Many of the 
examples in this paper can be viewed as attempts to generalize (II. ip . 

In 1851, Sylvester [381 EH] presented canonical expressions for binary forms. 

Theorem 1.1 (Sylvester's Theorem). 

(i) A general binary form p of odd degree 2s — 1 can be written as 



(1.2) p{x, y) = ^{ajX + /S^y) 



2s-l 
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2 BRUCE REZNICK 

(ii) A general binary form p of even degree 2s can be written as 



s 



;i.3) p{x, y) = Xx^' + V(a,x + /3jyf'. 






for some A G C. 



The somewhat unsatisfactory nature of the asymmetric summand in (11.3^ has been 
the inspiration for other canonical forms for binary forms of even degree. 

Another famihar canonical form is the generalization of (11.11) into the upper- 
triangular expression for quadratic forms, found by repeated completion of the square: 

Theorem 1.2. A general quadratic form p G iJ2(C") can be written as: 

n 

(1.4) p{xi, ...,Xn)= y^(tfc,fcXfc + tk,k+lXk+l H \-tk,nXny, tk,£ ^ C. 

k=l 

There are two ways to verify that a candidate expression F{t; x) is, in fact, a 
canonical form. One is the classical non-constructive method based on the existence 
of a point at which the Jacobian matrix has full rank. (See Corollary 12.31 and see 
Theorem 13.21 for the apolar version.) Lasker j23] attributes the underlying idea to 
Kronecker and Liiroth - see [13 p. 208]. 

Ideally, however, a canonical form can be derived constructively, and the number 
of different representations can thereby be determined. The convention in this paper 
will be that two representations are the same if they are equal, up to a permutation 
of like summands and with the identification of f'' and {(f)^ when (^ = 1. The 
representation in (II. 2p is unique in this sense, even though there are s! • (2s — 1)'^ 
different 2s-tuples (ai, /3i, . . . , a^, f3s) for which (II. 2p is valid. 

In addition to Theorem 11.11 another motivational example for this paper is a re- 
markable canonical form for cubic forms found by Reichstein [30j in 1987, which can 
be thought of as a "completion of the cube" . 

Theorem 1.3 (Reichstein). A general cubic p G H^^C^) can be written uniquely as 

n 

(1.5) p{Xu ...,Xn)=^ iliXi, ...,Xn)+ q{x^, 



, Xri 



where 4 G Hi{C') and q e H^{C 



fc=i 

n~2\ 



This is a canonical form, provided q is viewed as a t-linear combination of the 
monomials in (xa, . . . ,a;„); since A^(n, 3) = n^ + N{n — 2,3), the constant count is 
right. Iteration (see (16.11) ) gives p as a sum of roughly n^/4 cubes. The minimum 
from constant-counting, which is justified by the Alexander-Hirschowitz Theorem pQ, 
is roughly n^/6. We give Reichstein's constructive proof of Theorem 11.31 in section 
six. 

Here are some representative examples of the new canonical forms in this paper. 
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Theorem 1.4. A general cubic form p G //3(C") has a unique representation 

(1.6) p(xi, ...,Xn)= ^ ii{i,j},i^i H \- hi,j},jXjf, 

l<i<j<n 

where t^ijy^k G C. 

Theorem 1.5. yl general binary sextic p G Hq{C'^) can be written as p{x,y) = 
P{x,y) + g^{x,y), where f G Hs^C"^) is a cubic form and g G H2{€?) is a quadratic 
form. 

Theorem 11.41 has a constructive proof. Theorem 11.51 is in fact, a very special case 
of much deeper recent resuhs of VariUy-Alvarado. (See [12], especially Theorem 1.2 
and Remark 4.5, and section 1.2 of [13].) We include it because our proof, in the 
next section, is very short. 

Theorems 11.11 and 11.51 are both special cases of a more general class of canonical 
forms for Hd{C^), which is a corollary of [F, Thm. 4.4] (see Theorem 13. 4p . but not 
worked out explicitly there. 

Theorem 1.6. Suppose d > 1, {ij : 1 < j < m} is a fixed set of pairwise non- 
proportional linear forms, and suppose e^ \d,d>ei>--->er,l<k<r, and 

r 

(1.7) m + ^(efc + l) = rf + l. 

fc=i 

Then a general binary d-ic form p G Hdi'C?) can be written as 

m r 

(1-8) p(:^,t/) = ^t/?(x,i/) + 5^/f'=(x,t/), 

j=i fc=i 



where tj G C and fk G He,. (C 



k^" ^^ 



The condition Ck < d excludes the vacuous case m = 0,r = l,ei = d. If each e^ = 1 
and r = \_^^\ , then m = d + 1 — 2 \_^-^\ G {0, 1} and Theorem II .61 becomes Theorem 
ll.lt Theorem 1 1.5 1 is Theorem ll.6l in the special case (i = 6, m = 0, r = 2, ci = 3, 62 = 2. 
As an example of a canonical form which is unlikely to find a constructive proof: for 
a general p G Hs^iC^), there exist / G H42{C'^),g G H2s{C^) and h G -ffi2(C^) so that 
p = f"^ + g^ + h\ 

By taking (i = 2s, ei = 2, 62 = ■ ■ ■ = e^-i = 1 and ?7i = 0, in Theorem 11.61 we 
obtain an alternative to the dangling term " Ax^*" in (11.31) . 

Corollary 1.7. A general binary form p of even degree 2s can be written as 

s-l 

(1.9) p(x, y) = {aox^ + (3oxy + ^oV^ + ^("^0; + I3jyf'. 
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Cayley proved that, after an invertible linear change of variables {x,y) h-> [X,Y), 
a general binary quartic can be written as X^ + GXX'^Y^ + Y^. There are two natural 
ways to generalize this to higher even degree, and almost 100 years ago, Wakeford 
15] did both. 



Theorem 1.8 (Wakeford's Theorem). After an invertible linear change of variables, 
a general p G HdiC^) can be written so that the coefficient of each xf is 1 and the 
coefficient of each xf'^Xj is zero. 

There are N{n, d)—n'^ unmentioned monomials above, and when combined with the 
V? coefficients in the change of variables, the constant count is correct for a canonical 
form. Wakeford was also interested in knowing which sets of n{n — 1) monomials 
can be eliminated by a change of variables, and we are able to settle this for binary 
forms in Theorem 12.41 (Theorem 11.81 was independently discovered by Guazzone [H] 
in 1975, as an attempt to generalize the canonical form X^ + Y^ + Z^ + QXXYZ for 
H^i^C^). Babbage [2] subsequently observed that this can be proved by the Lasker- 
Wakeford Theorem, without noting that Wakeford had already done so in |15].) 

The second generalization of X^ + 6XX^Y^ + Y"^ will not be pursued here; see [HI 
Cor. 4.11]. A canonical form for binary forms of even degree 2s is given by 

s s 

(1.10) J2^l%x,y) + Xl[£l{x,y), ik{x,y) = a^x + (3ky. 

k=l k=l 

This construction is due to Sylvester ^\ for 2s = 4, 8. His methods failed for 2s = 6, 
but Wakeford was able to prove it in [H]. The full version of (ll.lOp is proved in 
im p. 408], where Wakeford notes that "the number of ways this reduction can be 
performed is interesting", citing "3,8,5" for 2s = 4,6,8. 

The non-trivial study of canonical forms was initiated by Clebsch's 1861 discovery 
([5], see e.g. p., pp.50-51] and [31, pp.59-60]) that, despite the fact that A^(3,4) = 
5 X A^(3, 1), a general ternary quartic cannot be written as a sum of five fourth powers 
of linear forms. This was early evidence that const ant- counting can fail. But A^(3, 4) 
is also equal to 1 x A'"(3, 2) + 3 x A^(3, 1), and ternary quartics do satisfy an alternative 
canonical form as a mixed sum of powers. 

Theorem 1.9. A general ternary quartic p G H^IC^) can be written as 

3 

(1.11) p{Xi,X2,X3) =q'^{Xi,X2,Xs) +^£fc(Xi,X2,X3), 



k=l 



where q G //2(C^) and 4 G Hi{C^). 



One might also consider polynomial maps F : S ^-^ Hd{C^), where 5* is an A^- 
dimensional subspace of some C*'''^. In the simplest case, for binary quadratic forms, 
observe that the coefficient of x"^ in 

(1.12) itix + t2y)^ + ittix + t-iyf, 
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is 0, SO f ll.l2p is not canonical. This is essentially the only kind of exception. 

Theorem 1.10. Suppose (01,02,03,04) G C*^, and it is not true that 03 = eoi and 
04 = eo2 for e G {±i}- Then for general p G H2{'C?), there exists (ti, ^2,^3,1^4) £ C*^ 
satisfying Yli=i'^j^j ~ ^'^'^ such that 

(1.13) p{x, y) = (tix + t2y)^ + (tax + t^y)^. 

In the exceptional case, there exists {xo,yo) so that for all feasible choices of tj, 
Pixo,yo) =0. 

Another alternative version of (II. 3p is the following conjecture, which can be veri- 
fied up to degree 8. 

Conjecture 1.11. A general binary form p of even degree 2s can be written as 

s+l s+1 

(1.14) p{x-,y) = 2,{oLj^ + (^jvY^ ) where /^(a^ + /3j) = 0. 

i=i i=i 

Here is an outline of the paper. In section two, we introduce notation and defini- 
tions. The definition of canonical form is the classical one and roughly parallels that 
in Ehrenborg-Rota [8], an important updating of this subject about 20 years ago. 
Our point of view is considerably more elementary in many respects than [8], but 
uses the traditional criterion: A polynomial map F : C^ i— )■ Hd{C^) is a canonical 
form if a general p G Hd{C"') is in the range; this occurs if and only if there is at least 
one point u G C^ so that {^(u)} spans ifrf(C"). (See Corollary 12.31 ) This leads to 
immediate non-constructive proofs of Theorems ll. 2(11. 5^11. 8l and ll.9[ and a somewhat 
more complicated proof of Theorem 12. 4[ which answers Wakeford's question about 
missing monomials for binary forms. 

In section three, we discuss classical apolarity and its implications for canonical 
forms. (Apolarity methods become more complicated when a component of a canon- 
ical form comes from a restricted set of monomials.) A generalization of the classical 
Fundamental Theorem of Apolarity from [33] allows us to identify a class of bases 
for Hd{C"') which give a non-constructive proof of Theorem II. 6 [ and hence Theorem 



11.11 We also present Sylvester's Algorithm, Theorem 13. 8| allowing for a constructive 
proof of Theorem 11.11 We conclude with a brief summary of connections with the 
theorems of Alexander-Hirschowitz and recent work on the rank of forms. 

In section four we discuss some special cases of Theorem 1 1.6 1 Sylvester's Algorithm 
is used in constructive proof of Theorem 11.61 when 0^ = 1 , in which case the repre- 
sentation is unique. We give some other constructive proofs for d < 4, and present 
numerical evidence regarding the number of representations in Corollary 11.71 and a 
few other cases. Using elementary number theory, we show that, for each r, there are 
only finitely many canonical forms (II. 8p with m = 0, and, up to degree A^, there are 
A^ + 0{N^'^) such canonical forms in which the e^'s are equal. 

Section five discusses some familiar results on sums of two squares of binary forms 
and canonical representations of quadratic forms as a sum of squares of linear forms. 
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This includes a constructive proof of Theorem 11.21 which provides the groundwork 
for the proof of Theorem ll.4[ Constant-counting has long been recognized as not 
fully applicable to sums of squares. We also give a short proof of a canonical form 
which illustrates the classical result that a general ternary quartic is the sum of three 
squares of quadratic forms. 

In section six, we turn to forms in more than two variables and low degree, give 
constructive proofs of Theorems 11.31 and II. 4[ as well as the non-canonical Theorem 
16.21 which shows that every cubic in H^{C'^) is a sum of at most "'■"^"'" ' cubes of 
linear forms. Theorem 11.31 can be "lifted" to an ungainly canonical form for quartics 
as a sum of fourth powers (see Corollary 16.31) . but not further to quintics. Number 
theoretic considerations rule out a Reichstein-type canonical form for quartics in 12 
variables; see Theorem 16.41 for other instances of this phenomenon. 

In section seven, we offer a preliminary discussion of canonical forms in which the 
domain of a polynomial map F : C^''^ H- Hd{'C"') is restricted to an A^- dimensional 
subspace of C^, of which Theorem 11.101 and Conjecture 11.111 are examples. 

The greatest debt of the author is due Richard Ehrenborg and Gian-Carlo Rota 
for writing [8]. Thanks to Dave Anderson and Julianna Tymoczko for organizing 
the Special Session on Geometric Commutative Algebra and Applications at the 
March 2011 AMS Sectional Meeting in Iowa City, to Lek-Heng Lim for organizing the 
Minisymposium on Tensor Rank at the October 2011 SIAM Conference on Applied 
Algebraic Geometry in Raleigh and to Eugene Mukhin for organizing ALGEC0M5 
in Indianapolis in October 2011. Invitations to speak at these conferences provided 
an opportunity to present preliminary versions of this material. I am indebted to 
Tony Varilly-Alvarado for extremely helpful correspondence. I also want to thank 
T. Y. Lam, Hal Schenck, Bernd Sturmfels and Doron Zeilberger for their assistance. 
Special thanks go to the Center for Advanced Study at UIUC, where the author was 
an Associate in the Fall 2011 semester, and thereby free of teaching responsibilities. 

2. Canonical forms 
Let X(n, d) denote the index set of monomials in Hd{C"'): 

(2.1) X(n,rf) = {(«!,..., 20:0<2fcGZ, J2'k = d}. 

k 

Let X* = x^i ■ ■ ■ ^^n ^'^d c(i) = T^-f denote the multinomial coefficient. If p G -ffrf(C"), 
then we write 

(2.2) p(a;i,...,x„) = ^ c(z)a(p; i)x\ a(p;i)eC. 

iGl(n,d) 

We say that two forms are distinct if they are non-proportional, and a set of forms 
is honest if the forms are pairwise distinct. For later reference, recall Biermann's 
Theorem; see [HI p. 31]. 
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Theorem 2.1 (Biermann's Theorem). If p G Hd^C"") and p ^ 0, then there exists 
i G X(n, d) so that p{i) ^ 0. 

The easy verification of whether a formula is a canonical form for Hd{C"') relies on 
a crucial alternative. A self-contained accessible proof is in [HI Thm. 2.4], for which 
Ehrenborg and Rota thank M. Artin and A. Mattuck. For further discussion of the 
underlying algebraic geometry, see Section 9.5 in Cox, Little and O'Shea [7j. 

Theorem 2.2. Suppose M > N and F : C^^ — )■ C^ is a polynomial map; that is, 

F{ti, . . . , tpj) = (/i(ti, . . . , ^m), • • • , fwiti, • • • , ^m)) 

where each fj e C[ti, . . . ,tM]- Then either (i) or (ii) holds: 

(i) The N polynomials {fj : 1 < j < A^} are algebraically dependent and F(C^^) 
lies in some non-trivial variety {P = 0} in C^ . 

(ii) The N polynomials {fj : 1 < j < A^} are algebraically independent and F{C^'^) 
is dense in C^. 

The second case occurs if and only there is a point u G C^^ at which the Jacobian 
matrix \_-^{u)\ has full rank. 

When M = N = N{n, d), we may interpret such an F as a map from C'^ to Hd{C") 
by indexing I{n, d) as {ik : 1 < A; < A^} and making the interpretation in an abuse 
of notation that 

AT 

(2.3) F{t-x) = J2<'k)fk{ti . . .,tN)x'K 

k=l 

Definition. A canonical form for Hd{C"') is any polynomial map F : C^ i— ;■ Hd{'C"') 
in which F satisfies Theorem 12. 2( ii). 

That is, F is a canonical form if and only if, for a general p G Hd{C"'), there exists 
t G C^ so that p{x) = F{t;x). In the rare cases where F is surjective, we say that 
the canonical form is universal. By translating the definitions and using (12. ip and 
(12.31) . we obtain an immediate corollary of Theorem 12.21 

Corollary 2.3. The polynomial map F : C^ H- Hd{C"') is a canonical form if and 
only if there exists m G C" so that {^{u)} spans ifrf(C"). 

We shall let J := J{F; u) denote the span of the forms {|^(m)}. In any particular 
case, the determination of whether J = Hd{C^) amounts to the computation of the 
determinant of an N{n, d) x N{n, d) matrix. As much as possible in this paper, we 
give proofs which can be checked by hand, by making a judicious choice of u and 
ordering of the monomials in Hd{C^), showing sequentially that they all lie in J. 

Classically, the use of the term "canonical form" has been limited to cases in which 
F{t;x) has a natural interpretation as a combination of forms in Hd{C^), such as a 
sum of powers of linear forms, or as a result of a linear change of variables. It seems 
odd that canonical forms are perceived as rare, since a "general" polynomial map 
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from C^ ^-)• Hd^C"") is a canonical form. (This is an observation which goes back at 
least to [SZ1-) For example, if {fj{x)} is a basis for Ha{C"'), then 

N 

(2.4) F{t;x) = Y,t,f,{x) 

should be (but usually isn't) considered a canonical form. In particular, 02. 2p with 
fj{x) = c{ij)x^^ is itself a canonical form. 

The following computation will occur repeatedly. If es = d, then 

(2.5) g = Yl ^.^^^ ^ §- = ^^'^^'"'- 

If g is specialized to be a monomial, then all these partials will also be monomials. 

N on- constructive proof of Theorem M.'A Given 01. 4p . let 

n n 

^k{x) = ^ tk,mXm, F{x) = ^£^(x). 
m=k k=l 

Then ^§^ = 2x^4- Set tum = h.m, so that 4 = Xk and ^§^ = Ix^x^- Since 
1 < k < m < n, all monomials from if2(C") appear in J. D 

Non- constructive proof of Theorem \1.5[ Suppose 

P{x,y) = f'^{x,y)+g^{x,y) : 

f(x, y) = tix^ + t2X^y + t-ixy"^ + t^y^, g{x, y) = t^x^ + t^xy + t??/^. 

Then by 02. Sp . the partials with respect to the tj's are: 

2xV, 2x^1//, 2X1/V, 2|/V; Sx^^?^^ ?>xyg\ ?>y^g\ 

Upon specializing at / = x^,^' = y^, these become: 

2x^ 2x^y, 2x^y^ 2x^1/^; 3x2l/^ 3xy^ , 3y^ 

It is then evident that J = Hq{C'^). U 

Non- constructive proof of Theorem M.^ Let C C X(n, c?) consist of all n-tuples except 
the permutations of (c?, 0, . . . , 0) and (c? — 1, 1, . . . , 0) and let Xj = ^"^^^ aijXj. The 

assertion is that, with the {N{n, d) — n — (^)) + n"^ = N{n, d) parameters t(, and ajj, 

(2.7) Y.Xf + Y,t^X' 

is a canonical form. Evaluate the partials at the point where Xj = Xj and t£ = 0: 
they are dxjX^~^ (for ajj) and x^ (for t^). Taking 1 < i,j < n and £ G £, we see that 
J contains all monomials in Hd{C^). □ 
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As a special case (used later in Theorem I4.6p . we obtain the familiar result that 
after appropriate linear changes of variable, a general binary quartic may be written 
as x^ + 6Xx^y^ + y'^. It is classically known (see [9l §211]) that for a given general 
quartic, there are six different values of A: ±A, ±^i^, ±yz^. We do not know any 
other quantitative results about the number of different values for the t^'s which 
appear in (12. 7p . 

Wakeford asserts that Theorem ll.Sl is also true with xf~^Xj replaced by xf~^x^j (ev- 
idently when r 7^ |), but his proof seems sketchy. He also gives necessary conditions 
for sets of n{n — 1) monomials which may be omitted, and these are hard to follow 
as well. Below, we answer his question in the binary case: in the only two excluded 
cases below, (12. 8 p has a square factor, and so cannot be canonical. 

Theorem 2.4. Let B = (rrii, m2, fii, n2) he four distinct integers in {0, . . . , d} so that 
{mi, m2} 7^ {0, 1}, {d — 1, rf}. Then, after an invertihle linear change of variable, a 
general binary form p of degree d can be written as 

(2.8) p{x, y) = x'^-'^'y'^^ + a;'^-"^^"^ + ^ t^x'^'^y^ 

for some {t^} C C 

Proof. Writing (x, y) i— ;■ (aix + a2y, oi^x + a^j) := (X, Y), we have 

(2.9) F = X'^-"iF"i + x^-"2y"2 ^ Y^ tkX^-^Y^. 

Evaluate the partials of (12.90 at (ai, 02, as, 04) = (1, 0, 0, 1) (so X = x, F = y) and 
tfc = 1 (note the difference with the previous proof, in which tk = 0). The d — 3 
partials with respect to the t^'s are simply x'^~^y^, k ^ B, so these are in J. Further, 

(2.10) f:= E (^-i)a:^-v, ^= E ^-'-'y'- 

Since most monomials used in (I2.10p are already in J, it follows that J also contains 

(2.11) {d - ni)x^-"iy"^ + {d- r^2)x'^-"2y"^ nix'^-"^?/"^ + n2x'^-"^l/"^ 

and since {d — ni)n2 ^ [d — n2)ni, (12. lip implies that x'^~"^y"^ G J for j = 1,2. 
To this point, we have shown that J contains all monomials from ifrf(C^) except for 
x'^-'^jy"^j ^ where mi < m2- The two remaining partial derivatives are 

(2.12) f - E (<'-.>^--'.-. 1^- E ^^-'*'y'-'- 

and so J contains as well the forms in (I2.12p of the shape cix'^~"^^y"^^ + C2x'^~"^^y"^'^. 
We need to distinguish a number of cases. If mi = 0, m2 = d, then these forms 
are y'^,x'^. If mi = and 2 < m,2 < d — 1, then these forms are {d — m2)x'^~"^^y"^^ 
and x'^ + {m2 + l)x'^~'^^y"^^ , and similarly when 1 < nii < d — 2 and m2 = d. In 
the remaining cases, 1 < rrii < m2 < d — 1. If m2 = rrii + 1, then these forms are 
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(d— (nil — l))x'^~™'^y™'^ and (m2 + l)x'^~'^^y"^^. Finally, if m2 > nii + l, then all four 
terms appear, and the forms are 

{d-mi + l)x'^-"''y"'' + {d-m2 + l)x^-"^2y™% 

(mi + l)x'^-"^y™^ + (m2 + l)x^-"^^l/™^ 

In each of the cases, linear combinations of the forms produce the missing monomials, 
so J = Hd{C^). n 

Remark. By writing p{x,y) = Ylki-^ + (^kV), it follows from Theorem 11.81 that, for a 
general set of d complex numbers a^, there exists a Mobius transformation T so that 

d d d 

(2.14) 5^T(a,)=0, E^(i) = 0' U^M = ^- 

k=l fc=l fc=l 

Non- constructive proof of Theorem \1.9i Write (II. lip as F{x;t), where 

q{Xi,X2, X3) = tixf + t2x\ + ^3X3 + 1(4X1X2 + t^XiX'i + ^6X2X3, 

4(a;i, X2, X3) = tfeixi + tfc2a;2 + ^sa^s- 

Evaluate the partials at: q = X1X2 + X1X3 + X2X3 and (4, 4, ^3) = (xi,X2,X3). Then 
^f- = 4x£X^, so xf^x^Xj G J; since |p = 2xfg = 2xi(xiX2 + X1X3 + X2X3), it follows 

that xfx2X3 G J, similarly, by considering |p and |p, it follows that xiX2X3,xiX2x| 
are in J. Finally, |p = 2xiX2g = 2xiX2(xiX2 + X1X3 + X2X3), and so now x^x^ G J. 
Similarly, by considering |^ and |^, it follows that xlx^jX^x^ are also in J, and this 
accounts for all monomials in H4{C^). D 

Other applications of Corollary 12. 3 1 to canonical forms can be found in |8], [37] and 
im pp. 265-269]. 

3. Apolarity 

Using the notation of (12. ip and (12. 2p . for p, q G Hd{'C"'), define the following bilinear 
form: 

(3.1) [P,q]= Yl c{i)a{p]i)a{q]i). 

iGl{n,d) 

Recall two basic notations. For a G C", define {a-Y G Hd^C"") by 

d 



c(i)a''x\ 



(3.2) (a-Yix) = {a ■ xf = (fl "^'^i) = E 

Define the differential operator f{D) for / G He{C'"') in the usual way by 

(3.3) f{D)= J2 cW«(/;0(afr)"---( 



d ''" 



iGX(n,e) 
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It follows immediately that for a G C", 

(3.4) [P,(«-)1= Yl c{z)a{p;z)a'=p{a), 

li i y^ j E X(n, d), then ik > jk for some k, so D^x^ = 0; otherwise D^x^ = Ylki'^kV- ~ 
d\/c{i). Suppose p, g G Hd{C"'). Bilinearity and (13. 3p imply the classical result that 

P{D)q= ^ c{i)a{p]i)D'{ ^ c{j)a{q] j)xn = 

ieX{n,d) \'eX(n,d) ^ 

(3.5) Yl Yl c(i)c(j>(p;i)a(g;j)DV = Y c{i)c{i)a{p;i)a{q;i)D'x' 

iGl{n,d) j£X(n,d) i&X{n,d) 

= Y c(«)^a(p;i)a(g;i)— - = rf![p,g] = rf![g,p] = g(D)p. 
, s c(«) 

Definition. If p G iJ(i(C") and q G He{C^), then p and q are apolar if p{D)q = 
q{D)p = 0. 

Note that if c? = e, then p and q are apolar if and only if [p, q] = and ii d > e, 
say, then the equation p{D)q = is automatic, so only q{D)p = need be checked. 
By f l3.4p . p is apolar to {a-Y if and only if p{a) = 0. 

The following lemma is both essential and trivial. 

Lemma 3.1. Suppose X = span{{hj}) C Hd{C"'). Then X = Hd{C"') if and only if 
there is no Q ^ p E Hct,{<C^) which is apolar to each of the hj 's. 

From this point of view. Theorem 13.21 is a direct consequence of Corollary 12.31 

Theorem 3.2 (Lasker-Wakeford) . If F : C^ -^ //^(C"'), then F is a canonical form 
if and only if there is a point u so that there is no non-zero form q G Hd{'C^) which 
is apolar to all N forms {^(w)}. 

The attribution "Lasker-Wakeford" (for [23lll5]) is taken from [H]: H. W. Turnbull 
(1885-1961) was one of the last practicing invariant theorists who had been trained 
in the pre-Hilbert approach, see [TO^, pp. 231-232]. (His text [H] is a Rosetta Stone for 
understanding the 19th century approach to algebra in more modern terminology.) 
Turnbull referred to Theorem l3.2l as "paradoxical and very curious". E. Lasker (1868- 
1941) received his Ph.D. under M. Noether at Gottingen in 1902. He is probably 
better known for being the world chess champion for 27 years (1894-1921), spanning 
the life of E. K. Wakeford (1894-1916). J. H. Grace, Wakeford's professor at Oxford, 
edited the second half of his thesis into the article [45j and also wrote a memorial 
article [13] for him in 1918: 

"He [EKW] was slightly wounded early in 1916, and soon after coming 
home was busy again with Canonical Forms.... [H]e discovered a paper 
of Hubert's which contained the very theorem he had long been in 
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want of - first vaguely, and later quite definitely. This was in March; 
April found him, full of the most joyous and reverential admiration 
for the great German master, working away in fearful haste to finish 
the dissertation ... He returned to the front in June and was killed in 
July.... He only needed a chance, and he never got it." 

The following properties are easily established; see, e.g., [211 E3j for proofs. 

Theorem 3.3. 

(i) Iffe i/e(C"), g e Hd^,{C^) andpe HdiC'), then 

(3.6) d\[fgM = {fg){D)p = f{D)g{D)p = e\[f,g{D)p]. 

Thus, p is apolar to every multiple of g in H(i{C^) if and only if p and g are apolar. 
(ii) If p e HdiC"), then ^-§^{a) = [p,Xj{ay-^]. Thus, p is apolar to [a-Y^^ if 

and only if p is singular at a. More generally, p is apolar to {a-Y~^ if and only if p 

vanishes to e-th order at a. 

(ill) Ife<d andge i7rf_,(C"), then g{D){a-Y = f ^(a)(a-)'. 

By (12.51) . if h{x) = 'Ylit^xine)^^^^ ^^"^ F{t;x) contains k^ as a summand, then 
1^ = sx^h'^~^, and Up is apolar to each |^, then it is apolar to h^~^. It is critical to 
note that this observation fails in case h is defined as a sum from a restricted set of 
monomials. 

We are now able to give a short proof of the "Second main theorem on apolarity" 
from [8], which was not concerned with preserving the const ant- count. 

Theorem 3.4. Suppose J£ = {je^i, . . . ,je,m), 1 < ^ < ''"; are m-tuples of non-negative 
integers, and suppose positive integers dk, 1 < k < m, and d are chosen so that 



(3.7) ur-= d- ^je,kdk > 0. 

fc=i 

Fix forms qi E i/„^(C") and for fk G ifd^(C"), define 

r 

(3.8) F(A,..., /J = ^g,(^)/i" ■■■/-'"■ 

Let Fj := ^. Then a general p G Hd{C^) can be written as (13. Sp if and only if there 

exists a specific f = (fk) so that no non-zero p G Hdi'C^) is apolar to each Fj{f), 
1 < j < m. If, in addition, 

m 

(3.9) 5^iV(n,4) = iV(n,d), 



fc=i 



then (13.81) is a canonical form. 
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Proof. Let 

(3.10) fj{x)= Yl hvx'^- 

By Theorem l2.2[ (13. 7p and Lemma [3?H fl3.8p represents general p G ifrf(C") if and only 
if there is some / so that there is no non-zero form in p G Ha{C"') which is apolar to 
each -§f—{f) = dkX^''Fj(f), or by Theorem I3.3( i). to each Fj(f). The constant count 
is checked by (jM]). D 



By Theorem I3.3( ii) and Theorem 13.41 F = Yl^k=ii'^k-) is a canonical form if 
and only if there exist r points ak ^ C" at which no non-zero form p G Hd{'C^) 
is singular. This result is classical. A particularly deep result of Alexander and 
Hirschowitz pLj from the early 1990s states that a general form in if^(C"), rf > 3, 
may be written as a sum of \^N{n, d)] d-ih. powers of linear forms, except when 
{n,d) = (5, 3), (3,4), (4,4), (5,4), when an extra summand is needed. (For much 
more on this, see ^ Lecture 7], [ISl Cor.1.62], [21, Ch.l5] and [291 Thm.0.2]; for 
a brief exposition of the proof, see [211 Ch.l5].) These references also discuss the 
exceptional examples, which were all known in the 19th century. The expression of 
forms as a sum of powers of forms is currently a very active area of interest; see the 
references above as well as [3], [11] and [22] . 

The Fundamental Theorem of Apolarity (see [33] for a history) states that if / is 
irreducible and p G Hfi{C^), then / and p are apolar if and only if p can be written 
as a sum of terms of the form {a-Y, where /(%) = 0. This was generalized in 



Theorem 3.5. [33, Thm. 4.1] Suppose q G ife(C") factors as YYj=i Qj ^ ^'^^^ ^ product 
of distinct irreducible factors and suppose p G Hd{C"'). Then q{D)p = if and only 
if there exist ajk C {qj{a) = 0}, and (pjk G iJmj_i(C") such that 



P{x) = Y^ ^0jfc(x)(ajfc ■ xY 



\d— (rrij — 1) 



The application of apolarity to binary forms is particularly simple, because zeros 
correspond to factors. li e = d + 1, then q{D)p = for every p G ifd(C"), and we 
obtain the following result, also found in [HI Thm.4.5]. 



rrij 



Corollary 3.6. Suppose {ajX + Pjy : 1 < j < r} is honest and suppose Y7j=i 
d + 1. Then the following set is a basis for HalC^): 

(3.11) S= {x^y'^^-^-\f3jX-ajyy~'^^+^ ■.0<k<mj-l, 1 < m^- < r} . 

Proof, li p is apolar to each term in 03. lip , then {ajX + fSjy)™-^ \ p by Theorem 13. 3( ii). 
Thus p = by degree considerations, and S has d + 1 elements, so it is a basis. D 

If each rrij = 1, then Corollary 13.61 states that an honest set S = {{djX + PjyY} of 
d+1 forms is a basis for Hd{C^). This is easily proved directly, since the representation 
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of S with respect to the basis {( .)x'''~^y^}, [a'^~^l3j], has Vandermonde determinant 

(3.12) H («,/?,■ -«,-A). 

l<i<j<n 

Each product in (13 .12^ is non-zero because {{ajX + (3jyy} is honest. One imphcation 
of this independence is found in [35l Cor. 4. 3]. 

Lemma 3.7. Ifp{x,y) G Hd{C^) has two honest representations 

m n 

(3.13) p{x,y) = 5^(aiX + Ay)'^ = ^(Tj-x + 6,yY 

and m + n < d + 1, then the representations are permutations of each other. 

Proof. If fl3.13|) holds, then {(a^x + [iiyY, {^jX + 5jyY} is hnearly dependent, which 
is impossible unless the dependence is trivial. D 

It follows immediately from Lemma [3.71 that the representations (11. 2p and (II. 3p . 
if they exist for p, are unique. When n > 3, the linear dependence of a set {{oij-Y} 
depends on the geometry of the points as well as the number (see the discussion of 
Serret's Theorem in [SH p.29] .) Even for powers of binary forms of degree e > 2, there 
are singular cases. It is not hard to show that a general set of {2k + 1) A;-th powers of 
quadratic forms is linearly independent; however, for example, (x^ — y'^Y + i'^^vY — 
(x^ + y'^Y- For much more on this, see [36] . 

Non- constructive proof of Theorem \l.(A For 1 < /c < r, write 



efc 



fk{x,y) = ^t 



k.x^^'V 



i=0 



By Corollary 12.31 and (12.51) . (11.81) is a canonical form in the variables {tj, tk/} provided 
there is a point at which the partials 

{i"^, 1 < J < m} U {x'^^^y/f ''="\ 1 < £ < Cfc, l<k<r} 

span Hd{C^). Let fk = i'j!', where {ii, . . . , im, h, ■ ■ ■ , C-t} is chosen to be honest. Then 
by (II. 7p . the desired assertion follows immediately from Corollary 13.61 D 

We present now Sylvester's Algorithm. For modern discussions of this, along with 
Gundelfinger's generalization [15], which is not included here, see [201 §5], [T7].[T8]. 
[191, [33] and [35]. 



Theorem 3.8 (Sylvester's Algorithm). Let 



i=o 
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ctx"" y 



W^PjX-ajV). 



he a given binary form and suppose {ajX + Pjy} is an honest set. Let 

r 

h{x,y) = ^1 
Then there exist Afc G C so that 

r 

if and only if 



fe=i 



(3.14) 






\ 



\ 



ad- 



O-d-r+l 



O-r+l 



ad j 



/co\ 

\crl 



/o\ 



Vo/ 



Theorem 13.81 can be put in the context of our previous discussion. Let Ar{p) 
denote the (c? — r + 1) x (r + 1) Hankel matrix on the left-hand side of fl3.14p . If 
^(-^) — YVj=iil^j§ — ^j^)i then a direct computation shows that 



(3.15) 



h{D)p = J2 



d\ 



m=0 



{d 



mjimi 



' d-r 



h+m^i 



X 



d—r~ 



mm 



It follows from (I3.15P that the coefficients of h{D)p are thus, up to multiple, the rows 
of the matrix product, so (I3.14p is equivalent to h{D)p = 0. In this way. Theorem 
3.81 follows from Theorem 13.51 Sylvester's algorithm can also be visualized as seeking 
constant-coefficient linear recurrences satisfied by {ak} and looking for the shortest 
one whose characteristic equation has distinct roots; this is the proof given in [35] . 
In this case, Gundelfinger's results handle the case when the roots are not distinct. 



Constructive proof of Theorem \l.l\ Suppose c? = 2s — 1 is odd. The matrix Agip) is 
s X (s-fl) and has a non-trivial null- vector. The corresponding h (which can be given 
in terms of the coefficients of p) has distinct factors unless its discriminant vanishes. 
Thus for general p G if2s-i(C^), Theorem 13.81 gives p as a sum of s (2s — l)-st powers 
of linear forms. 

\i d = 2s, the matrix As{p) is square, and if p is a sum of s 2s-th powers, then 
det As{p) = 0. Conversely, if detAs{p) = and the corresponding h has distinct 
factors (which is generally true), then p is a sum of s 2s-th powers. If Mi and M2 are 
two square matrices and rank(M2) = k, then det(Mi + AM2) is a polynomial in A of 
degree k. In particular, if g = {ax + (3y)'^^, then rank(ifs(g)) = 1. Thus, in general, 
there is a unique value of A and some matrix M so that = det As{p— X{ax+ /3y)'^'^) = 
det As{p) — A det M. (When ax + Py = x, M is the (l,l)-cofactor of As{p).) In the 
special case ax + (3y = x, this proves Theorem ll.l( ii). The same argument shows 
that for general q G H2s{C^), there exist s + 1 values of A so that p — Xq is a. sum of 
s 2s-th powers. D 
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In 1869, Sylvester [30] recalled his discovery of this algorithm and its consequences. 
"I discovered and developed the whole theory of canonical binary forms 
for odd degrees, and, as far as yet made out, for even degrees too, at 
one evening sitting, with a decanter of port wine to sustain nature's 
flagging energies, in a back office in Lincoln's Inn Fields. The work 
was done, and well done, but at the usual cost of racking thought — a 
brain on fire, and feet feeling, or feelingless, as if plunged in an ice-pail. 
That night we slept no more." 

Example 3.1. This example of Sylvester's algorithm will be used in Example 4.1. Let 
p{x, y) = 2x^+3x'^y-21xy^ -Aly^ = {l)-2x^+ (f) ■ 1 x^y+ Q ■ (-7) xy^+ g) ■ (-41) y^ 
Since 

■ -' -) (f) ■ (; 

we have h{x, y) = 6x'^ — 5xy + y'^ = (2x — y){3x — y). It now follows that p{x, y) = 
Ai(x + 2yY + X2{x + 3yY, and a simple computation shows that Ai = 5, A2 = —3. 

Lemma [3.11 when applied to Theorem 12.11 yields the following corollary. 

Corollary 3.9. A basis for Hd{C^) is given by {{i-Y : i G I{n,d)}. 

This in turn gives a weak version of the Alexander- Hirschowitz Theorem, 

Corollary 3.10. A general form in HdiC^) is a sum of N{n,d—1) = "^^_j^ ■ ^N{n, d) 
d-th powers of linear forms. 

Proof. Consider the sum 

N(n4-l) 

y^ {tti,lXi H h ti-aXnY, 

1=1 

and apply Corollary 12.31 with t^ specialized to i^ G I{n,d — 1). Then J contains 
Xk{irY~^ for each k^C and hence XkHd-i{C"') C J for each k, so J = Hd{C"'). □ 

4. Some new binary canonical forms 

This section is devoted to special cases of Theorem 11.61 First, if e^ = 1, we give a 
constructive proof showing uniqueness, which gives a kind of interpolation between 
Sylvester's Theorem and the representations of Hd{C^) by (12.41) with a fixed basis 
consisting of d-th powers, as in Corollary 13.61 

Corollary 4.1. Suppose d > 1, and {ij{x,y) = ajX + f3jy} is a fixed honest set of 
m = d + 1 — 2r linear forms. Then a general binary d-ic form p G Hd{C^) can be 
written uniquely as 

m r 

(4.1) p{x, y) = J2 ^j("j^ + f^^y^ + 5Z(^'^i^ + ^'^22/)'^- 

j=i fc=i 
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for some tk G C. 
Proof. Let 

m 

f{x,y) = Y[{/3jX-ajy). 
i=i 
Then f{D)p has degree d — m = 2r — 1 and by Theorem 13.81 generally has a unique 
representation as a sum of r 2r — 1-st powers of linear forms, say 

r 

(4.2) f{D)p = Y,inkix + Uk2y?'-'. 

k=l 

Further, it is generally true that f{uki,Uk2) 7^ 0. Let 

.^ 3^ (x )= '^^^~^^' V (^fci^ + ^k2yY 

It follows from Theorem I3.3( iii). ( 14. 2 p and fl4.3p that f{D)p = f{D)q. Since / has 
distinct factors, it then follows from Theorem 13.81 that there exist tj G C so that 

m 

P{x,y) -q{x,y) = ^tj{ajX + ^jyy. 
Conversely, suppose p has two different representations: 

m r m r 

(4.4) Yl ^i^i(^' y) + Y^^tkix + h2yy = J2 ^/j (^' y) + Y^^tkix + ik2yy. 

j=l k=l j=l k=l 

By combining the first sum on each side, fl4.4p becomes a linear dependence with m + 
2r = d + 1 summands, which by Lemma [3771 must be trivial; thus, the representations 
in (14. 4p are essentially the same. D 

Example 4.1. Let ii{x, y) = x + y and ^2(2;, y) = —x + 3y and let 

p{x, y) = -x^ + Ibx^y - ITOx^y^ + 390x^2/^ - SOSx^y^ + 483|/l 
In an application of the last proof, /(x, y) = {x — y){3x + y) = Sx^ — 2xy — y^, and 

3—^ - 27^^ - tt4 = leOx^ + 240x^y - 1680xy2 - 3280yl 

Example 13.11 implies that this expression equals 400(x + 2yY — 240(x + 3y)^. Since 
/(I, 2) = -5 and /(I, 3) = -12, it follows that 

p{x,y) = 

3! -400 ,^ , ^^,,5 , 3! -(-240),^ , ^^,,5 , ^ ,^ , ^,,5 , ^ , ^ , ^^,,5 



(x + 2yy + ' \ ; (x + 3?/)^ + ti(x + y)^ + t2i-x + SyY 



5! -(-5)' ^' 5! -(-12) 

-4(x + 2y)^ + (x + 3yf + ti(x + y)^ + t2(-a; + Syf 

and it can be readily be computed that ti = | and ^2 = |. 
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If each Cfc = 2 in Theorem 11.61 and m is as small as possible, then we obtain an 
analogue of Sylvester's Theorem for forms of even degree. 

Corollary 4.2. 

(i) A general binary form of degree d = 6s can be written as 

2s 

(4.5) xx^' + J2(^j^'^ + i^j^y + ^jyy 

for some A G C. 

(a) A general binary form of degree d = 6s + 2 can be written as 

2s+l 

(4.6) J2ic^,^' + P,^y + ijy'?'^' ■ 

(Hi) A general binary form of degree d = 6s + 4 can be written as 

2s+l 

(4.7) Aix6^+^ + X^y''^' + ^(«,x2 + (3,xy + i.y^f'^^ 

i=i 

for some Aj G C. 

We have not been able to find an analogue to Sylvester's Algorithm for determining 
the representations (14. 5p . (14.61) . (14.71) in Corollary 14. 2[ In the linear case, (ax + PyY 
is killed by (3^ — a-^, and two operators of this shape commute. Although each 

{ax'^ + 2(3xy+'yy'^Y is killed by the non-constant-coefficient {l3x+'yy)-^ — {ax+(3y)-^, 
two operators of this kind do not usually commute. The smallest constant-coefficient 
differential operator which kills (ax^ + 2Pxy + ^y'^Y has degree d+1; the product of 
any two of these would kill every form of degree 2d and so provide no information. 

Let us say that (II. 8p is a neat canonical form if m = 0, and of Sylvester-type if it is 
neat and if e^ = e for 1 < k < r. Counting the numbers of neat and Sylvester-type 
canonical forms leads to some number theory. The first lemma is standard. 

Lemma 4.3. Given < - G Q and < n G N, there exist only finitely many choices 
of rrij G Z, < mi < m2 ■ ■ ■ < mn, such that - = Yl^=i '^■ 

Proof. If n = 2, then - > ^ > ^ implies that there are finitely many integral 

choices for mi, each of which determines m2 = (- —)~^- Supposing the lemma 

valid for n — 1, we have -> — >—, and each choice of mi implies the equation 
- — ;^ = YTj=2 '^- "^^^^ ^^^ finitely many solutions by the induction hypothesis. D 

Theorem 4.4. For fixed value ofr, there are only finitely many neat canonical forms 
fll.SD with r summands. 
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Proof. Suppose m = in Theorem 11.61 Write d = ekmk, then by ( II. 7p . 

(4,8) <i+i = vfA + i) ^ i = v^ + ^ = y:-+E3- 



. TTifc / ^-^ Vdh d ^-^ Vflu ^-^ d 

k=\ ^ ^ k=l fc=l e=i 

Now apply Lemma 14.31 with - = 1 and n = 2r — 1: there are only finitely many 
expressions of 1 as a sum of 2r — 1 unit fractions, of which only a subset satisfy the 
additional restrictions of (14. 8p . D 

It is not hard to work out that for r = 2, there are three neat canonical forms: 
{d, ei, 62) = (3, 1, 1), (4, 2, 1) and (6, 3, 2). The first is Theorem II. l( i) with d = 3, the 
second is Corollary 1 1 . 71 with d = 4 (see Theorem 14.61 below) . and the third is Theorem 
11.51 When r = 3, there are twenty-two neat canonical forms. 

Let s{d) denote the number of neat Sylvester- type canonical forms of degree d. 
Suppose Cfc = e for all k in one of these. Then e | d and, by (II. 7p . r(e -|- 1) = (i-|- 1, so 
(e -|- 1) I (d+l). Since d = (mod e) and d = —1 (mod (e -f 1)), it follows from the 
Chinese Remainder Theorem that d = e (mod e(e + 1)); that is, rf = e + ue{e + 1), 
M > 1, so that e < y/d. 



^7V 



Theorem 4.5. Let S{N) := ^'^^-^ s{d) . Then S{N) = N + 0(A^V2) ^^^ ^^^^ ^^^^ 



00. 



^e^+2e 



rf,e^+e ' 



N -e 
_e2 + e_ 


T 
e=l 


N-e 
_e2 + e_ 



Proof. The generating function for the sequence {s{d)) is 

00 00 00 00 

(4.9) E^(^)^' = EE^^^"^^^^'^ = Et 

n=l e=l u=l e=l 

Let T = [A^i/2j ^ j^ follows from (jH]) that 

Af 00 

(4.10) S{N) = J2sn = Yl 

n=l e=l 

Thus, using the telescoping sum for J2 e(e+i) ' (14-lOP implies that 
(4 11) ^ e^ + e ^ e^ + e ^ e + 1 

\ / e=l e=l e=l 

< A^(l - 2^) - log T + 0(1) = iV - N^^^ + 0{hgN). 

The lower bound is the same, minus T, so (14.111) implies that >S'(A^) = N + 0{N^^'^). 
Now, s{d) counts the number of e < d so that e divides d and e + 1 divides d+1. If 
d = 2'' - 1, then e -fl | 2^ implies that e + 1 = 2* for some t <r. But 2* - 1 | 2^ - 1 if 
and only if t | r, hence s(2'' — 1) = d{r) — 1, where d{n) denotes the divisor function. 
In particular, s(2^ — 1) = t, so the sequence {s{d)) is unbounded. More generally, 
ii e \ d and e + 1 | d + 1, then e \ d"^ + 2d and e + 1 | d"^ + 2d + 1, and since e = d 
contributes to the count in s((i^ + 2d) but not in s{d), s{d'^ + 2(i) > s((i) + 1. D 
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Half of the neat Sylvester forms come from Theorem ll.l( i). another sixth come 
from Corollary 14 .2( 11). etc. The smallest d for which s{d) = 2 is d = 15: (e,r) = 
(1,8), (3,4), so a general binary form of degree 15 is a sum of eight linear forms to 
the 15th power, or four cubics to the 5th power. The smallest d for which s{d) = 3 
is d = 99: (e,r) = (1, 50), (3, 25), (9, 10). For d < 10^ the largest value of s(rf) is 
s(7316000) = 12. Note that 2^" - 1 = 2^^^^ -1 ^ 1.04 x lO^^sa^ g^ ^^le examples 
given in the proof are not likely to describe the fastest growth. We conjecture as well 
that {s{d)} has an underlying distribution. 

If the degree d is prime, then Theorem 14.11 accounts for all canonical forms in 
Theorem 11.61 The smallest d which is not covered by Theorem 14.11 is then d = A, 
and there are two such cases, one of which is neat: ei = 2, 62 = 1, m = and 
ei = 2,m = 2. Both can be discussed constructively. 

Theorem 4.6. A general binary quartic p G H^IC'^) can be written as 

(4.12) p{x, y) = (tix^ + t^xy + t^y^f + [t^x + t^yf 

in six different ways. Further, the set {^} is the image of the set {0, cx), 1, — l,i, —i} 
under a Mobius transformation. 

Proof. By Theorem 12. 4[ if p is a general binary quartic, then there exist Cj, A so that 
p{cix + C2y, C3X + c^y) = px{x, y) := x^ + 6Ax^y^ + y^. If (I4.12p holds for p\, then 

l = tl + tl, = 2tit2 + 4t^t5, QX = 2tit3 + tl + Qtltl, 

= 2t2t3 + 4t4t^, l = tl + tl 

First suppose that t^ = 0. Then (I4.13P implies that 1 = t1 and = 2tit2, so ti = 1 
(without loss of generality) and ^2 = 0. The remaining equations imply that t^ = 3A 
and tg = 1 — 9A^. A similar argument works if t^ = 0, giving two representations: 

(4.14) px{x, y) = (x^ + 3A2/2)2 + (1 _ Q\^)y^ = (sAa;^ + y^f + (1 - gA^)^^ 

Now suppose ^4^5 7^ 0, so tit2i3 7^ and so 

h _ — 2t2t3 _ 4t4t5 _ tg ^^ 1 — ^3 _h _h _. .2 _ ^2 
ti —2tit2 4:tJ:^ t^ 1 ~ ^1 ^4 ^1 

It follows that is = i'^t^ and t^ = (— l)'^ti, and (I4.13P can be completely solved: 

tl = i-tl, t2 = 2i'{ti~t^'), 2 + 6(-l)'=A = 4tr'. 
After some massaging of the algebra, this gives four representations: 

PA(a:, y) = (^^^^) {^' - ^^^3A - {-lf)xy + (-1)^^)^ 

+ ( 3A + (-l)0 ^" + ^'^) ' ^ = 0,1,2,3. 

In order to find the six representations of p as (I4.12p . we start with the six rep- 
resentations of p\ given in (I4.14p and (14.150 . in which t/^x + t^y is a multiple of 
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d ei,...,er m F{d]e) Source 

d iL^-J or 1 1 Theorem ll.il 

d V d+l-2r 1 Theorem O 

4 2,1 6 Theorem SS] 

4 2 2 2 Theorem O 



6 3,2 40 [121113] 

6 2,1^ 22 Experiment 

6 3,1 1 14 Experiment 

6 2^ 1 9 Experiment 

6 2,1 2 12 Experiment 

6 3 3 5 Experiment 

6 2 4 5 Experiment 

8 2,1^ 62 Experiment 

10 2,1^ 147 Experiment 

12 2,1^ 308 Experiment 

2s 2,1^-1 2('+^) - {'f) Conjecture 

Table 1. Values of F{d; e) 



one of the six hnear forms x,y,x + i^y. Apply the the inverse of the map (x, y) i— ;■ 
{ciX+C2y, C3X+C4?/), which takes t^x+t^y to a multiple oiti{ciX—C2y)+t5{—C3X+Ciy): 
I ^ G(|), where G{z) = |^. D 

Theorem 4.7. Given two fixed distinct binary linear forms ii,i2, « general binary 
quartic in H^i^?) has two representations as 

(4.16) p(x, y) = (tix^ + t2xy + t^y'^f + td\ + t^i 

Proof. Given p, £i,£2; rnake an invertible linear change of variable taking (^1,^2) '^ 
{x,y), and suppose p{x,y) ^-)■ q{x,y) = ^^ ajx"^"*?/*. Then q has the shape fl4.16p if 
and only if the coefficients of x^y, x'^y'^, xy^ in [tix"^ + t2xy + t3y'^)'^ and q agree. Thus, 
we seek to solve the system 

(4.17) ai = 2tit2, a2 = 2tit^ + t^, 03 = 2t2t3. 

But ( I4.17P implies ait\ — 202^1^2 + 203^^ = 0, hence in general, there are exactly two 
values of /3 so that ^2 = Pti'-, in each case, ^1 = fg- The two choices of sign for ti lead 
to the same square, and ^3 = — ti, so (14.171) has these two solutions. D 

In the case of Theorem 11.61 let F{d\ ei, . . . , e^) denote the number of different rep- 
resentations that a general p G Hd{C^) has, by our convention. We present in Table 
1 a complete list of proved or conjectural values when d < Q, reflecting numerical 
experiments on Mathematica. (Recall that if d is prime, then Theorem 14. II presents 
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all possible canonical forms of this type.) The conjectural value of F{2s; 2, V^^) is 
suggested by the given data for 2 < s < 6 and 0EIS[21 A081282]. 

Varilly-Alvarado, in [l2ll33], constructs explicitly all 240 representations oi x^ + y^ 
as P + g'^'i he considers forms multiplied by roots of unity as different, which explains 
the appearance of ^ in the table above. This is also proved to be the number of 
representations for a general sextic. 

To describe the experiments for -F(2s; 2, 1'^^^) more precisely, we generate a form 



p{x,y) = Y,[l) 



akx"^' ''y\ 



where ak = t + iu for random integers t, u in [—100, 100]. In case ?7i < 2, we assume 
a change of variables so that the fixed linear forms are x'' or y''; for ?ti > 2 we choose 
additional linear forms with random coefficients. Let /i(x, y) = Ux"^ + Vxy + Wy^ 
for variables {U,V,W) and let q{x,y) = p{x,y) — h'^{x,y), and apply Sylvester's 
Algorithm to q. That is, we construct the (s + 2) x s matrix As^i{q), with polynomial 
entries in {U, V, W) of degree s and require that it have rank < s. This is done by 
counting the number of [U, V, W) which are common zeros of all s x s minors. This 
number is divided by s to account for h^ = [Cgh)''^. As a back of the envelope 
calculation, one might take the first s — 1 rows of As-i and use the cofactors to 
compute a non-trivial null-vector. Ignoring possible cancellation, the components 
would be polynomials of degree s{s — 1) in {U, V, W). Taking the dot product with 
the last three rows of As_i gives three polynomials of degree s^. Ignoring cancellations 
and multiplicity, there should be (s^)^ common zeros, and dividing by s gives an upper 
bound for F(2s; 2, 1*~^) of s^. The conjectural value is asymptotically ^s^, which at 
least has the same order of growth. 

5. Quadratic forms and sums of squares 

We begin this section with a constructive proof of Theorem 11.21 which will serve as 
a template for constructive proofs involving cubic forms. 

Constructive Proof of Theorem M.^ Suppose p G if2(C"), and specifically, 

n 
P\X\^ . . . , Xn) / ^ OjUX^ + Z, y ^ Qj^jXiXj . 

1=1 l<*<i<" 

Then g^ = 2 ^"^-^ a^Xj, Since an 7^ in general, we can define g(xi,...,Xn) = 

p{xi, ...,Xn)- ^(Z]"=i "ija^i)^ observe that ^ = 0, so g = g(x2, ...,Xn). Iterating 
this argument gives the construction. There is only one linear form ±i so that 
g^ = 2i-^, so the representation is unique. D 



As we saw in the proof of Theorem 15.21 constant-counting for sums of squares 
is complicated by the action of the orthogonal group on a sum of t squares. If 
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M e MattiC) and MM* = /, then 



t 



2 



When t = 2, choose 9 E C and let e*^ = cos 9 + i sin 9 := (m, v), so that 

(5.1) /2 + ^2^(^^_^^)2^(^^^^^)2_ 

This means that we may safely remove one monomial from one of the summands. 
Theorem 5.1. A general binary form p G if2s(C^) can be written as 

(5.2) (j2^kx'''y'] + [i^^s+kx^-v 

\k=0 / \k=l 

m ( '^^ ) different ways. 

Proof. The non-constructive proof is a simple application of Corollary 12.31 Writing 
(15.21) as /^ + g^ gives the partials with respect to the tj's as 

{2x'"V/, < A; < s} U {2x'-''y''g, I < k < s} ; 

specializing to f = x^ and g = y"^ above gives all monomials in if2s(C^). 
The more obvious expression 

(5.3) p{x, y) = f (x, y) + /(x, y), g,h e H^iC^) 

is not a canonical form, because 2(s + l) > 2s + 1. However, every sum of two squares 
can be formally factored, and these behave nicely with respect to (15. ip . 

f + 9' = {f + ^9)if-^9) ^^ 
[uf + vgf + {vf - ugf = {e'\f + ig)) {e''\f - ig)) . 

Suppose p(l, 0) = ao 7^ (true for general p) and (15.31) holds, where /(I, 0) = p and 
g{l, 0) = r. Then p^ + t"^ = oq, so that - j^ ±i and the coefficient of x* in vf + ug 
will be vp + UT = sin9p + cos 9t, which is zero exactly when tan^ = — -. Thus for 
precisely one value of tan 6', the right-hand side of (15. ip will be in the form (15. 2p . 
This determines a pair (±u, ±f ); however, the squares in (15. 2p will be the same. 

In other words, each distinct factorization of p (up to multiple) as a product of 
two s-ic forms, when combined with the orthogonal action of (15. ip . yields exactly 
one representation as (15.21) . A general p G H2s{C^) is a product of 2s distinct linear 
factors; these can be organized into an unordered pair of products of s distinct linear 
factors in |( j^) = ( ^7 ) "ways. □ 



The "lost" degree of freedom in a sum of squares never arises in Theorem 11.6 
because 2(| + l)>(i+l. The missing monomial x* in the second summand of (15. 2p 
may be replaced by any specified x*"'^"?/^" by a similar argument. 
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Another classical result is that a general ternary quartic is a sum of three squares 
of quadratic forms, generally in 63 different ways up to the action of the orthogonal 
group (see [28].) Hilbert proved that every (not just a general) positive semidefinite 
real p G H^lM.^) is a sum of three squares from H2{M.^). A constructive discussion 
has recently been given in papers by Powers and the author [27j, Powers, Scheider, 
Sottile and the author [28], Pfister and Scheiderer [25] and Plaumann, Sturmfels and 
Vinzant [26]. A non-constructive proof (without the count) can easily be given. 

Theorem 5.2. A general ternary quartic p G H^IC^) can be written as p = qf + q2 + 
g|, where qj G H2{C^). 

Proof. We take g^'s so that the monomial x^ only appears in gi and the mono- 
mial y^ only appears in gi and q2, and so the number of coefficients in the g^-'s is 
6 + 5 + 4 = 15. Taking the partials where (gi,g2,g3) = (a^^,l/^,^^) shows that J 
contains 2x^{x^, 7/^, z^, xy, xz, yz}, 2y'^{y'^, z"^, xy, xz, yz} and 2z'^{z'^, xy, xz, yz}, and 
so is equal to H4{C^). D 

Since 3(™^^) — 3 < (^™^^) for m > 3, this result does not generalize to ternary 
forms of higher even degree. 

The situation is somewhat simpler over M. A real version of Theorem 15.11 appears in 
|34j . Up is real and positive definite and p = f'^ + g'^, where / and g are also real, then 
the factors of p consist of s conjugate pairs. In the factorization p = {f + ig){f — ig), 
the pairs must be split between the conjugate factors, and if p has distinct factors, 
this can be done in 2^^^ different ways. A real generalization of Theorem 15.21 appears 
in [H Cor. 2. 12]. Suppose a real psd form p G if2s(K'^) is a sum of t squares and 
2;ft g Hs(W^), 1 < i < t, is given. Then there is a representation p = Yli=i9h i^ 
which x^' does not occur in gj for j > i. This argument can also be applied to a 
general sum of t squares over C, but it is not universal, and depends on the psd 
condition: if xy = {ax + by)"^ + {ex + dyY, then abed ^ 0. 



6. Cubic forms 

In this section, we present three representations for forms in H^{'C") as a sum of 
cubes of linear forms. The first two are canonical; the third isn't, but it is universal. 

We begin with Theorem 11.31 which first appeared [3D] in a 1987 paper of Boris 
Reichstein. At the time of this writing, [50] has had no citations in MathSciNet. (It 
was discussed in [32] and, from there, in [0]. The former was never submitted for 
publication and the latter appeared in an unindexed journal.) The original presenta- 
tion and proof in [30] were given for trilinear forms (see §2); the theorem is applied 
to cubic forms there mainly in the examples. 

By iterating (11.51) . we obtain a canonical form for p G i/3(C"), see [30^, p. 98]. 
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Corollary 6.1. A general n-ary cubic p G H^i^C^) can be written uniquely as 

L(n-1)/2J n-2m 

(6.1) p{Xi, ...,Xn)= Yl Yl (^ml+2mXl+2m + " " " + 4^^")^ 

m=0 k=l 

for some t^^j G C. 

This gives p as a sum of n+(n— 2) + - ■ ■ = \_ ^ ^ \ cubes. Recall that by Alexander- 
Hirschowitz, for n 7^ 5, a general cubic form in n variables can be written as a sum 
of [" '""*" g "*" ' ~\ cubes. Thus (16. ip represents a general cubic as a sum of about 50% 
more cubes than the true minimum; this is due to the large number of linear forms 
with restricted sets of variables. 

Reichstein's proof of Theorem 11.31 requires the well-known "generalized eigenvalue 
problem" for pairs of symmetric matrices, as interpreted for quadratic forms: if a 
general pair of quadratic forms /, (? G H2{C"') is given, then there exist n linearly 
independent forms Lj(x) = X]?=i c^j^j and q G C so that 

n n 

(6.2) /=E^'' 9 = Y.^^^- 

i=l i=l 

We may also assume that the coefficients aij are generally non-zero; c.f. Corollary 



Proof of Theorem \1.3[ For general p G H^IC"), we simultaneously diagonalize / 
1^ and (7 = 1^ 



—^ and g = ^ as in (16. 2p . Since mixed partials are equal. 



11 



(6.3) ^ = ^ = E2«-^^ = E2^^"-^ 

and since the Lj's are linearly independent, (16. 3p implies that q;j2 = Cian. 
It is generally true that an 7^ 0. Let 

n 



g(xi, ...,Xn)= p{xi, . . . , x„) - E sk^^^i- 



i=l 



It follows that 



n 

2 



dq _ dp_ sr-^ San ^ _ _^ _ \^ r 

dxi ~ dxi ^ 3aii ' ~ dxi ^ ' "' 

4 = 1 1=1 

dq _ dp ^3ai2 2_ 9p v^ .2 _ n 

1=1 J=l 

Since ^ = ^ = 0, we have q = q{x3, . . . , x„). D 

We now give a constructive proof of Theorem II. 4[ which gives a different canonical 
form for Hs{C^) requiring even more cubes. 
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Proof of Theorem \1.4[ The constant-counting makes this a potential canonical form: 
the variables are t{j,j},A; with l<i<j<k<n, and there are ("J" ) = N(n, 3) 
such triples (i, j, k). Given p G i/3(C"), g^ is a quadratic form, so we can generally 
complete the square by Theorem ll.2t 

— — = 2_^\'^3J^3 + ■ ■ ■ + tjnXn) ■ 

Then tj„ 7^ for general p and if we let 

n 
(JyXi, • • • ) 3JnJ P\Xi, • • • ) 2^?iJ / ^ 3i \ jj"^] ~r ' ' ' ~r <^jnXn) ; 

i=i 
then ^ = 0, so g = g(xi, . . . , x„_i). Iterate this construction to get fll.6p . D 

It is not hard to give nonconstructive proofs of Theorems 11.31 and 11.41 using Corol- 
lary 12.31 These are left for the reader. 

We first presented this next construction in [32j; an outline of the proof can be 
found in [6j. This is not a canonical form, but is included here because it gives an 
absolute upper bound for the length of cubic forms. 

Theorem 6.2. If p & H^IC"'), then there exists an invertible linear change of vari- 
ables Hj = Yl ^jkXk o-nd n linear forms Ij so that 

n 
(6.4) p{xi,...,Xn) = ^fj{Xi,...,Xn) + q{y2, ■ ■ ■ , Vn)- 

i=i 
Thus every cubic in n variables is a sum of at most ("^ ) cubes of linear forms. 

Proof. Define linear forms ij^m{y) for 1 < J < '^ + 1 by 

m 

^j,miyu • • • , 2/n) = %■ + a ^ yj, l<j<m, 
(6.5) 



„ , N /-, N V^ -(m + 1) + y/m + 1 

i^m+l,m[yi,---,yn> = -(1 + ma) } Vj, « = 7 — T • 

^^-^ mim + 1 

i=i 

Then it can be easily checked that 

m+l m+1 m 

(6.6) ^£^.^(^) = and J^ ^?, Jy) = J] 2/^. 

i=l 3 = 1 k=l 

Suppose 7^ p G H^i^C""). Use Biermann's Theorem to find a point u where 
p{u) 7^ 0, and after an invertible linear change of variables, taking {xj} t— )■ {uj}, we 
may assume that p(l, 0, . . . , 0) = 1 and so 

(6.7) p = ul + 3hi{u2, . . . , Un)ul + 3/12(^2, • • • , Un)Ui + hs{u2, ...,Un), 
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where deg hj = j. Now let ui = yi — hi{u2, ■ ■ ■ ,Un) to clear the quadratic term, so 

(6.8) P= yl + 3^1/^2(^2, . . . , Mn) + hs{u2, ...,Un), 

where again deg hj = j. Diagonalize h2{u2, ■ ■ ■ ,Un) as a quadratic form into ?/| + 
■ ■ ■ + y^, where r < n, and make the accompanying change of variables. We now have 

(6.9) p = yl + 3yi{yl + --- + y^) + ks{y2,...,yn); r < n, 

where deg k^ = 3. Finally, using (16.51) and (16. 6p . we construct g, a sum of r < n 
cubes: 

1 "" 

givi, ■■■,yn) ■■= -^{yi + Vr- 4^-1(2/2, • • • , yr)) 

(6.10) = - E ^? + 7^ E ^i^^--i + 3 E y<r-i + v^E 4r-i 

j=l V j^i j^i j^i 

r 

= yl + ^yM + --- + yl) + v^E4r-i(?/2> • • • >i/r)- 

Then q := p — g is a. cubic form in (1/2, • • • , 1/n) as in (16.41) . Iteration of this argument 
shows that any cubic p G HsiC"") is a sum of at most "'■"^"^ ' cubes. D 

Theorem 11.51 can be extended to a canonical form for quartics as a sum of fourth 
powers of linear forms. Note that x„ appears in each summand of (16. ip . with, gener- 
ally, a non-zero coefficient. 



Corollary 6.3. For general p G iJ4(C"), there exist ik G ifi(C") and q G H^i 
so that, with a{n) = [ '""^ -* J, 

a{n) 

p{Xi,...,Xn) = '^ikiXi, . . . ,Xn)'^ + q{Xi, . . . ,Xn-l). 
fc=l 

As a consequence, a general p G H^i^C^) can be written as 

[(n-l)/2j n r-2m 

P[Xi, . . . , Xn) = / ^ / ^ / ^ l^m,r,l+2m-^l+2-m + " " " + tj^^^j.Xr) ■ 

m=0 T=l+2m k=l 

Proof. By Corollary 11.31 and (16. ip . for general p G i74(C"), we can write 

„ [{n-l)/2j n-2m 

3L- \^ \^ (4''} r + . . . + /{^} T ^3 

a ~ / ^ / J l''m,l+2m-^l+2m "T "T ''rn,n-^nj 

(6.11) """ ™=° '=' 



E E(^^''(^))' 



m=0 fc=l 
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As before, if g = P - Efc m -Jw^tm^ then ^ = 0, so g = g(xi, . . . , x„_i). Repeat 

as before. There are N{n, 3) coefficients in (16.111) . and since N{n, 3) + N{n — 1, 4) = 
iV(n, 4), the count is correct for a canonical form. D 

Note that there is no variable which appears in each linear form in (16. lip , so the 
argument can't be extended to quintics. For the same reason, Theorem 11.41 does 
not extend to quartics. By combining Theorems 11.31 and 16.31 we obtain canonical 
forms as a sum of powers of linear forms in the four exceptional cases of Alexander- 
Hirschowitz, of course at the expense of the number of summands. With regards 
to ternary quartics, and Theorem II. 9[ Corollary 16.31 becomes the following canonical 
form for H^iC"^) as a sum of seven fourth powers. 

3 2 

'^{tklXl + tk2X2 + tfcsXs)^ + tioXg + y^XujiXi + U12X2Y + U^xl- 
fc=l i=l 

There is an arithmetic obstruction to a "Reichstein-type" canonical form for quar- 
tics; that is, one in which each linear form is allowed to involve each variable. If 

r 

(6.12) p{xi, . . . , Xn) = ^(afclXi H h aknXnY + (i{^i^ • • • 5 Xm)- 

k=l 

were a canonical form for some n, then we would have A^(n, 4) = rn + N{m,4). 
However, for n = 12, there does not exist m < 12 so that 12 | (4) — ("^''^ ), so no 
such canonical form can exist. More generally, let 

(6.13) A,= {n:0<m<n ^ nj( ("+,^-i) - (™+,^-i) } 

denote the set of n for which this argument rules out Reichstein-type canonical forms. 
We present without proof a number of results about A^. Note that there is no obstacle 
for (16.121) in prime degree, such as c? = 2, 3. 

Theorem 6.4. 

(i) If3 )(k, then n = 2^'' ■ 3 e A^. 

(a) If p = 1 (mod 144) is prime, then 12p e A4. 

(Hi) If p is prime, then p \ (^"+p~i^ _ (^"^^ hence Ap = ^ for prime p . 

(iv) The smallest elements 0/ Ag, Ag, Aio, A12, v4i4 and Ai^ are 10, 1792, 6, 2^2, 
338 and 273 respectively. If Ag or Aiq are non-empty, then their smallest elements 
are at least 10^. 



7. SUBSPACE CANONICAL FORMS 

One natural generalization of the definition of canonical forms is to consider maps 
F : X I— )■ ifrf(C") where X C C^ is an A^(n, (i)-dimensional subspace of C*^. These 
can be analyzed completely in the simplest non-trivial case: M = 4, A^(2, 2) = 3. 
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Proof of Theorem \l.l(K Assume that some Cj 7^ 0. Without loss of generahty, we 
may assume that C4 7^ and divide through by C4 so that the equation is t^ = 
Oiti + 02^2 + flats, where Oj = — Cj/c4 for i = 1,2, 3. Then fll.lSp can be parameterized 
as a map from C^ 1— )■ H2{'C^) as: 

(7.1) F{t; x) = {hx + t2yf + {hx + (aiti + 02^2 + a^h)yf. 
The partials with respect to the tj's are: 

2x{tix + t2y) + 2aiy{tj,x + (aiti + 02^2 + 0-3^3)1/), 

(7.2) 2y(tix + t2y) + 2a2y{hx + (aiti + 02^2 + a^h)y), 

2{x + a^y){hx + (aiti + 02^2 + a''ih)y)- 

Now, (17.11) is a canonical form if and only if there exists a choice of tj so that the 
three quadratics in (17.21) span H2{C^). A computation shows that the determinant 
of the forms in (17.21) with respect to the basis {x'^,xy,y'^} is the cubic 

(7.3) - 8((aia2 - a3)ti + (1 + al)t2 + (0203 + ai)t3){aitl + a2tit2 + a^tih - t2h). 

The second factor in (17.31) always has the term —^2^3 and so never vanishes, hence 
this determinant is not identically zero (and (17. ip is a canonical form), unless 

(7.4) aia2 — a^^ = 1 + a\ = 0203 + ai = 0. 

In the exceptional case where (17. 4p holds, then 02 = e, where e = ±i, and a^ = eai. 
Evaluating (17. ip at {x,y) = (ai,e) yields 

(aiti + et2)^ + (aits + eaiti + e^t2 + £^^1^3)^ 

= (aiti + et2)^ + ((1 + e^)ait3 + eai^i + eh2f = (aiti + et2f + e^(aiti + et2)^ = 0, 

as claimed. D 

It would be interesting to know how this generalizes in higher degrees. 

Conjecture 11.111 is true for degree 2 by Theorem I l.lOi We have verified it for even 
degrees up to eight by Corollary 12.31 applied to random choices for aj,f3j in (I1.14p . 
We hold some hope that generalizations such as Corollary 1 1 . 1 II will have applications 
in more than two variables as well. 
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